Spike Timing Dependency Plasticity (STDP)

My Last Second
I'm not talking about the last second before I die. I have no knowledge of that experience and no way to report it when it happens. I'm talking about the last second you and I experienced while reading this very sentence. What facility in your brain gives this narrow, less than one-second view of the world around you? Where is it physically located, and how does it work?

The search for this function is the culmination of my quest to understand the process in the brain that allows us to participate in the moment.

I have a strong feeling that STDP is closely tied to the last second but after a long review of papers on the field, I find that the experts are as confused as I am, not a good sign for success. A good summary is by Feldman 'The Spike-Timing Dependence of Plasticity'.

It turns out that the basis for STDP is backpropagation (bAP) - that is, when the neuron spikes and sends its output down the axon chain, it also produces a second spike that propagates back into the dendrite tree. This seems to be what effectuates STDP in the synapses.

I found a way to modify the simulator to work with backpropagation and am moving forward with it. This solution involves sending Windows messages to various body parts during the simulation. You can see the progress on this solution in the Notes Volume 1 Chapter 5 and below.

It is amateurs who have one big bright beautiful idea that they can never abandon. Professionals know that they have to produce theory after theory before they are likely to hit the jackpot.
-Francis Crick

My Theasus:

Without self-organization, the brain would start but soon stop - Locked down at an optimum condition. To keep the brain working, you need a little noise. Enough to jolt self-satisfied neurons out of their complacency and into action but not so much that the signal gets lost in the noise.

Aside from a little noise, you need some way that the brain can organize itself into a workable whole. It cannot be done by some sort of brain-within-brain composite brain that makes the final decisions based on inputs from all other parts of the brain. That duality requires that the 'inside brain' is made out of some stuff that is 'not of this world'.

I believe that STDP may be a primary mechanism enabling the self-organization required for conscious neural dynamics.

A good look at the field is presented in STDP an Overview [Markram]

With all that in mind lets dive into STDP details.

Notes from Song

These notes were taken from the sentinel paper on STDP from Song, Mill and Abbott in March 2000 (available in the reference section)

It must not have a global arbitrator involved. Local competition only
Competition can also arise locally due to synaptic modification mechanisms that equilibrate at a per-set level of total synaptic innervation or activity [9 ]
Spike timing provides a mechanism that can lead to competitive Hebbian learning without requiring global intracellular signaling, pre-set activity, or synaptic efficacy levels.
Long-term strengthening of synapses occurs if presynaptic action potentials precede postsynaptic firing by no more than about 50 ms.

The STDP function plotted.
For example a spike comming in 20 mSec 'early' will improve that synapse by about 0.18 %. A 'late' incomming spike of 20 mSec will decrease that synapse by about -0.18%

The basic function used in this simulation:

where τ+ and τ− determine the ranges of pre- to postsynaptic interspike intervals over
which synaptic strengthening and weakening occur. A+ and A− determine the maximum
amounts of synaptic modification, which occur when delta t is close to zero.

The temporal windows in the range of tens of milliseconds.

Simulator Summary

The simulator’s STDP implementation combines:

Real dendritic propagation delays
Soma-centered timing evaluation
Back-propagating action potentials
Additive STDP learning curves
Nearest neghbor selection of AP for evaluation
End-of-run synaptic evaluation

to produce a biologically-inspired learning system in which synapses self-adjust according to their causal contribution to neuronal firing.

Basic Signal Flow

The simulator models the signal path as :

Synapse → Dendrite Tree → Soma → Axon

Each synapse injects current into a dendrite branch.
The signal then travels through the dendritic tree toward the Soma with realistic propagation delays.

When the Soma membrane potential reaches threshold, the Soma generates an Action Potential (AP).

The AP then:

Travels forward down the Axon
Travels backward into the dendritic tree as a Back-Propagating Action Potential (bAP)

The bAP is the critical signal used for STDP learning.

Why Timing Matters

A synapse should only be strengthened if its activity actually contributed to causing the Soma spike.

To determine this, the simulator compares:

The time the synapse released neurotransmitter
The time the Soma generated an AP

However, dendritic signals require time to travel.

A synapse located far from the Soma fires earlier than one close to the Soma if both are to influence the same AP.

Therefore the simulator evaluates timing in the Soma’s reference frame.

Dendritic Transit Delay Compensation

Each dendrite branch has an associated signal propagation delay. When a synapse fires:

Synapse release time + dendritic transit delay

This determines when that signal actually influences the Soma. Likewise, when the Soma fires an AP, the simulator sends a bAP message backward through the dendritic tree carrying:

Soma AP time
Cumulative reverse transit delay

This allows every synapse to determine Δt = SomaSpikeTime − ffectiveSynapseArrivalTime

where:

EffectiveSynapseArrivalTime =
SynapseReleaseTime + ForwardDendriticDelay

STDP Learning Rule

The simulator uses the classic additive STDP relationship described by Song, Miller, and Abbott (2000). The learning rule depends on the sign and magnitude of Δt.

In the simulator, synaptic strength can be represented in several ways, including:

Number of effective synaptic connections
Current amplitude contributed by the synapse
Total synaptic output current

Increasing synaptic weight causes the synapse to inject more current into the dendrite, making future Soma spikes more likely. Decreasing weight reduces its future influence.

The STDP mechanism allows the simulated neuron to learn temporal correlations automatically.

Over time:

Useful synapses become stronger
Irrelevant synapses weaken
Frequently co-active pathways self-organize
The neuron becomes selectively responsive to repeating input patterns

This creates an emergent learning behavior similar to biological neural tissue.

End-of-Run Evaluation

The simulator does not immediately alter synaptic weights during the run. Instead, each synapse maintains a history of:

Neurotransmitter release times
Local dendritic timing information
Received bAP timing information

At the end of the simulation run, the synapse evaluates all timing relationships and computes its total weight adjustment.

This approach provides:

Stable evaluation
Easier visualization
Full event history analysis
Support for nearest-neighbor timing evaluation

Back propagation

The simulator explicitly models bAPs because biological neurons use them as a feedback signal to indicate the Soma fired. Without a bAP, distant synapses would have no reliable way to determine whether they contributed to generating the action potential.

The bAP therefore acts as the neuron’s internal teaching signal.

Long-Term Potentiation (LTP)

If the synapse helped trigger the Soma spike Δt > 0 the synapse is strengthened. This means the Synapse fired BEFORE the Soma AP and likely contributed causally to the spike. The closer the timing, the larger the strengthening effect.

Simulation of LTP

Long-Term Depression (LTD)

If the synapse fired after the Soma AP Δt < 0 The synapse is weakened. This indicates the synapse was poorly correlated with causing the spike.